Swivel: Improving Embeddings by Noticing What's Missing
نویسندگان
چکیده
We present Submatrix-wise Vector Embedding Learner (Swivel), a method for generating lowdimensional feature embeddings from a feature co-occurrence matrix. Swivel performs approximate factorization of the point-wise mutual information matrix via stochastic gradient descent. It uses a piecewise loss with special handling for unobserved co-occurrences, and thus makes use of all the information in the matrix. While this requires computation proportional to the size of the entire matrix, we make use of vectorized multiplication to process thousands of rows and columns at once to compute millions of predicted values. Furthermore, we partition the matrix into shards in order to parallelize the computation across many nodes. This approach results in more accurate embeddings than can be achieved with methods that consider only observed cooccurrences, and can scale to much larger corpora than can be handled with sampling methods.
منابع مشابه
What's in an Embedding? Analyzing Word Embeddings through Multilingual Evaluation
In the last two years, there has been a surge of word embedding algorithms and research on them. However, evaluation has mostly been carried out on a narrow set of tasks, mainly word similarity/relatedness and word relation similarity and on a single language, namely English. We propose an approach to evaluate embeddings on a variety of languages that also yields insights into the structure of ...
متن کاملAutomated Detection of Non-Relevant Posts on the Russian Imageboard "2ch": Importance of the Choice of Word Representations
This study considers the problem of automated detection of non-relevant posts on Web forums and discusses the approach of resolving this problem by approximation it with the task of detection of semantic relatedness between the given post and the opening post of the forum discussion thread. The approximated task could be resolved through learning the supervised classifier with a composed word e...
متن کاملA small swivel joint for infusion of free moving animals.
construction and application of a small, light, inexpensive swivel joint suitable for infusions of small laboratory animals is described. Commercially available tubes and needles are used in the construction of the swivel thus making it easy to prepare and essentially disposable. This swivel is especially useful in long-and short-term infusions of pharmaca and nutrients in the general circulati...
متن کاملDNA swivel enzyme activity in a nuclear membrane fraction
DNA swivel (nicking-rejoining) enzyme activity has been studied in various cell fractions of a human lymphoid cell line. Swivel activity is found only in chromatin and in a nuclear membrane fraction containing DNA and possessing endogenous DNA synthesizing activity. Twenty percent of the total swivel activity and less than one percent of the total DNA are in the membrane fraction. The swivel en...
متن کاملEnsemble cryo-EM uncovers inchworm-like translocation of a viral IRES through the ribosome
Internal ribosome entry sites (IRESs) mediate cap-independent translation of viral mRNAs. Using electron cryo-microscopy of a single specimen, we present five ribosome structures formed with the Taura syndrome virus IRES and translocase eEF2•GTP bound with sordarin. The structures suggest a trajectory of IRES translocation, required for translation initiation, and provide an unprecedented view ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1602.02215 شماره
صفحات -
تاریخ انتشار 2016